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DATA DEPENDENT SCRAMBLER WITH IMPROVED GLOBAL CONSTRAINT 



CROSS-REFERENCE TO RELATED APPLICATIONS 
[OdOi] This application claims the benefit of U.S. Provisional 
Application No. 60/509,356. filed on October 7, 2003, which is hereby 
incorporated by reference in its entirety. 

FIELD OF THE INVENTION 
[0002] The preisent invention relates to data coding in a 
communications channel, and more particularly to data coding that reduces the 
huriaber of consecutive zeros in a. communications channel. 

BACKGROUND OF THE INVENTION 
[0003] Magnetic storage systenns such as disk drives . include a 
magnetic medium or platter with a magnetic coating that Is divided into data 
tracks. The data tracks are divided into data sectors that store fixed-sized data 
blocks. A read/write head typically includes a write circuit and' write element such 
as an inductor that selectively generates positive and negative magnetic fields 
that are storied by the magnetic medium. The stored positive and negative fields 
represent binary ones and zeros. The read/write head includes an element such 
as a magneto-resistive elertient that senses the stored magnetic field to read 
data from the magnetic medium. A spindle motor rotates the platter and an 
actuator arm positions the read/write head relative to the magnetic medium. 

I:\MP0336VAPP_MP0336_MDW_1 1_13_03.doe 1 
Express Mail Label No. ELe23307S38US 



Marvell Ref. No. MP0336 •> . PATENT 

[0004] Magnetic storage systems typically code the user data 
sequence or user data using Run Length Limited (RLL) code. RLL coding 
reduces sequences in the user data that may cause problems with timing circuits 
of the magnetic storage system. For example, a RLL code enforces constraints 
on the number of consecutive ones and/or zeros that are allowed to occur in the 
data, the efficiency of the RLL code is typically measured by a code rate. For 
every m-bits or m-bytes of user data, an n-bit or h-byte encoded word is written 
to the storage media. RLL codes are used to eliminate unwanted bit patterns in 
the data and typically do not have enror correction capability. RLL coding, 
however, reduces data storage capacity by increasing channel bit density (CBD), 
which reduces a signal to noise ratio (SNR) and may lead to lower reliability. 

[0005] Refening now to FiG. 1, a write-path of. a data storage system 
with RLL coding is shown. A host bus interface (HBI) 14 receives user data from 
a host computer 16. A buffer manager (BM) 18 initially stores the user data in a 
buffer 20 and then sends the user data from the buffer 20 to a disk formatter (DF) 
22 with proper timing. An ECC/CRC encoder 24 appends CRC bits to the user 
data. The CRC bits are calculated based on the user data. 

[0006] ECC bits are computed based on the user data and the CRC 
bits and. appended thereto. A scrambler 26 generates a pseudo-random 
scrambling sequence that is based on a polynomial and seed. The user data 
and the scrambling sequence are input to an XOR gate 27, which outputs 
scrambled user data. A RLL encoder 28 is used to constrain the unwanted bit 
patterns in the scrambled user data sequence. 
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[0007] To increase SNR and data storage capacity, data storage 
systems were developed without RLL coding using data-dependent scramblers 
|(DDS). Data is processed by the DDS to eliminate unwanted bit patterns. The 
DDS is disclosed in "Improved Data Coding For Enforcing Constraints On Ones 
and Zeros In a Communications Channel", U.S. Patent Application Serial No. 
10/423.552, filed April 25, 2003, which is commonly assigned and is hereby 
Incorporated by reference in its entirety. The scrambled user data from the DDS 
is fonwai-ded to an ECC/CRC encoder, which generates and appends CRC and 
ECC bits to the DDS coded user data. 

[0008] Because the CRC and ECC bits may also contain unwanted bit 
patterns, traditional RLL coding is used to encode the ECC/CRC portion but not 
the user data. The data storage systerh is still l-efenred to as being without RLL 
coding because the CRC and/or ECC bits are relatively small in number as 
compared to the number of bits in the user data. Refemng now to FIG. 2. a write- 
path of a data storage system that includes the DDS is shown. The host computer- 
IB generates user data that is transfen-ed to the host bus interface (HBI) 14. The 
buffer riianager (BM) 18 receives the user data from the HBI 14 and controls the 
data flow through the write path. The user data is periodically stored in the buffer 
20. The BM 18 passes thie user data on based on sequence timing of data flow 
through the write path. The disk fomnatter (DF) 22 receives the user data from the 
BM 1 8 and formats the user data. 

[0009] The DDS 40 receives the user data from the DF 22 and, 
scrambles the user data to eliminate the unwanted bit patterns. An ECC/CRC 
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encoder 42 appends CRC bits that are calculated based on the scrambled user 
data. ECC bits are also computed based on the scrambled user data and the CRC 
bits. The ECC bits are also appended to the scrambled user data. The DDS 40 
ensures that the miaximum number of consecutive zeros (or the Global constraint or 
G-constraint) is at most 18 when the symbol size M is equal to 10. In some 
circumstances, It is desirable to improve the Global constraint. 

SUMMARY OF THE INVENTION 
[0010] Accordingly, the present invention provides a data dependent 
scrambler for a communications channel that receives a user data sequence 
including N symbols each with M bits. The data dependent scrambler includes a 
seiBd finder that selects a scrambling seed and a first scrambler that receives 
said usei- data sequence and said scrambling seed from said seed finder. The 
firet scrambfer generates a scrambled user data sequence. A first encoder 
identifies a string of X consecutive zeros in adjacent symbols of the scrambled 
used data sequence and replaces one of the adjacent symbols with an all-one 
symbol. The first encoder replaces the other of the adjacent symbols with first 
bits representing a position of the string of X consecutive zeros and second bits 
representing bits of the adjacent symbols that are not in the string of X 
consecutive zeros. 

[0011] In other features, ,X is greater than M. Both of the adjacent 
symbols are hot all-zero. The seed finder selects the scrambling seed based 
upon the symbols in the user data sequence. 
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[0012] In other features, X is equal to 12 and M is equal to 10. The 
start position is identified using Y least significant bits of the second symbol. Y is 
equal to 3. 

[0013] In still other features, the data dependent scrambler further 
includes an H-code token finder that generates an H-code token that is 
dependent upon the symbols in the user data sequence. An H-code encoder 
receives the scrambled user data sequence frohri the scrambler and increases a 
Hamming weight of the scrambled user data sequence using the H-code token. 
The H-code encoder outputs the scrambled user data to the encoder. The data 
dependent scrambler is Inriplemented in a write path Of a data storage system. 

[0014] Further areas of applicability of the present invention will 
become apparent from the detailed description provided hereinafter. It should be 
understood that the detailed description and specific examples, while indicating 
the prefen-ed embodiment of the invention, are intended for purposes of 
illustration only and are not intended to limit the scope of the invention. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

[0015] The present invention will become more fully understood from 
the detailed description and the accompanying drawings, wherein: 

[0016] FIG. 1 is a functional blocl< diagram of a data storage system 
with RLL coding according to the prior art; 

[0017] FIG. 2 is a functional block diagram of a data storage system with 
a data dependient scrambler (DDS) coding according to the prior art; 

[0018] FIG. 3 is a functional block diagram of a write path of a 
comniunications channel according to the present invention and corresponding 
user data; 

[0019] FIG. 4 is a functional block diagram of a read path of the 
communications channel according to the present invention and corresponding 
user data; 

[0020] FIGs. 5A and 5B are functional block diagrams of data 
dependent scramblers (DDS) in the communication channel for the write path 
and read path, respectively, according to the present invention; 

[0021] FIGs. 6A and 6B illustrate G-constraint encoding according to 
the present invention; 

[0022] FIG. 7 is a flowchart illustrating G-constraint encoding along the 
write path of the communications channel; and 

[0023] FIG. 8 is a flowchart illustrating G-constraint decoding along the 
read path of the communications channel. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

[0024] The following description of the preferred embodiments is 
merely exemplary in nature and is in no way intended to limit the invention, its 
application, or uses. For purposes of clarity, the same reference numbers will be 
used In the drawings to identify similar elements. 

[0025] While the present invention will be described in the context of a 
data storage system, skilled artisans will appreciate that the present invention 
can be applied to any binary communications channel with constraints on the 
number of consecutive ones or zeros. As will be described further below, the data 
storage system according to the present Invention does not employ RLL coding on a 
user data portion. The present invention discloses a codinig technique that eliminates 
unwanted bit pattems with a smaller reduction in data storage capacity as confipared 
to RLL coding. In other words, the coding technique according to the present 
invention reduces the channel bit density (CBD) less than data storage systertis 
using RLL coding on the user data. As used herein, the term data dependent 
scrambler (DDS) is defined as a scrambler that alters at least one of a selected 
sccambler, a generating polynomial, a seed, and a scrambling sequence based 
upon current user data that is to be scrambled. 

[0026] The DDS according to the present invention selects a 
scrambling seed that is dependent upon the symbols in the user data sequence. 
The DDS optionally perfomis coding to improve the Hamming weight of the 
scrambled user data sequence. Instead of perfomriing coding to improve the 
nurnber of zeros in interleaved subsequences, the present Invention performs 
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coding to improve the G-cx)nstraint. For example, for M=^0, prior approaclies 
produce G=18 and the present invention produces G=1 1 . 

[0027] Refem'ng now to FIG. 3, a write path 50 of a communications 
channel is illustrated with a coiresponding user data description that is generally 
identified at 52. A host computer 54 outputs user data to a host bus interface (HBI) 
56. Although not illustrated, other elements of the write path 50 may also include 
the buffer rtianager 18, the buffer 20, and the disk fortnatter 22 as shown in FIGs. 1 
and 2. A data dependent scrartibler (DDSyencoder 60 receives the user data from 
the HBI 56 and processes the user data to remove unwanted bit pattems. The 
scrambled user data finom the DDS/ehcoder 60 is procisssed by an ECC/CRC 
encoder 62. which appends CRC bits that are calculated based on the scrambled 
user data and ECC bits that are calculated based on the scrambled user data and 
the CRC bits. A RLL encoder 64 encodes the ECC and CRC bits to remove any 
undesired bit pattems In these portions. The scrambled user data and the RLL bits 
are passed on to a vyrite circuit. 

[0028] Refenirig now to FIG. 4, components of a read path 66 of the 
communications channel are illustrated with a con-esponding user data description 
that is generally identified at 68. The read path 66 transfers data from the data 
storage system to thie host computer 54. Although not illustrated, other elements of 
the read path 66 can Include the buffer mahager 18, the buffer 20, and the disk 
formatter 22 shown in FIGs. 1 and 2. An RLL decoder 70 decodes the RLL code 
that was generated based on the ECC and CRC bits. An ECC/CRC decoder 72 
receives the output of the RLL decoder 70 and attempts to con-ect all possible enors 
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in the user data. A DDS/decoder 74 reverses the scrambling and encoding that 
was performed by the DDS/ehcoder60. 

[0029] Refenihg now to FIG. 5A, the DDS/encoder 60 is shown in further 
detail and includes a data buffer 76 that receives the user data and coordinates data 
transfer timing through the DDS/encoder 60. A seed/code finder 78 generates a 
scrambling seed (S) and an H-code token (H) as described in further detail below. A 
first exclusive OR (XOR) gate 80 perfomis a bit-wise XOR operation on the user 
data and the scrambling seed (S) from the seed/code finder 78 to generate a 
scrambled user data sequence. 

[0030] An H-<x)de encoder 82 receives the scrambled user data 
sequence from the XOR gate 80 and the H-code token H and encodes the 
scrambled user data sequence to improve the worst-case Hamming weight thereof, 
as will be described further below. A G-constraint.encoder 84 receives the encoded 
and scrambled data sequence and encodes the data sequence to limit the 
maximum length of a run of zeros in the sub-sequences, as will be described 
further below. A pi-ecoder 86 optionally transfers all data in the interleaved 
nonretum-to-zero-inverse (INRZI) or nonretum-to-zero-inverse (NRZI) domain into 
the nonretum-to-zero (NRZ) domain. 

[0031] The operation of the DDS/encoder 60 will be described with 
reference to FIG. 3 and 5A. In this description, an exemplary M-bit symbol data 
sequence (D = {Dn-i. Dn.2. .... Do}) of size N is used, where M = 10 (I.e., 10-bit 
symbol data sequence). As can be appreciated, any sequence length or symbol 
size can be used. The user data (D) is initially sent to the data buffer 76 and the 
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seed/code finder 78. The user data is output to the first XOR 80, The delay of the 
data buffer 76 is preferably sufficient to allow the scrambling seed S and H-code 
token H to be generated by the seed/code finder 78. 

[0032] Given the exemplary data sequence D, there are 1023 non-zero 
10-bit symbols. More specifically, if the number of bits in the symbol is M, it is 
always possible to find a seed S if N < 2"^. This is because not all of the 2^ different 
possible symbols can . be included in a data sequence of fewer than 2'*" symbols. 
Assuming that the user data includes 4096 bits that are organized into 10-bit 
symbols, there are at least 203 (1023 - 2 * 410) non-zero symbols that are different 
from any of the data symbols and their complements (i.e., bit-wise inversion). More 
particularly, the seed/code finder 78 preferably excludes symbols with a 
Hamming weight of one. S and H are selected such that they are different from 
any of the user data symbols and their complements. H is also not the 
complement of S. S is transmitted to the first XOR gate 80 and S and H are 
transmitted to the H-code encoder 82. 

[0033] A scrambling sequence {S, S S} is fomned by repeating the 

scrambling seed S N times. The first XOR gate 80 perfomis bit-wise XOR of the 

scrambling sequence {S, S S} with the data sequence D = {Dn-i. Dn-2. .... Do} to 

obtain a scrambled sequence 0 = {Cn-i, Cn.2 Co}. The scrambled data sequence 

(C = {Cn-i, Cn.2, .... Co}) does not contain either an all-zero symbol or an all-one 
symbol. At a minimum, each symbol includes one "1" and nine "0"'s, which provides 
a worst-case Hamnhing weight of one (or 10% for M=10) for the scrambled user data 
sequence. 
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[0034] To Improve the worst-case Hamming weight of the scrambled user 
data sequence C, the H-code encoder 82 uses a first token (tokeni) and its 
complement (token2) to perform additional coding. The H-code decoder 92 that 
is associated with the DDS/decoder 74 on the read path 66 reverses the H-code 
encoding. The H-code encoder 82 generates tokeni by calculating S XOR H. 
Since H is different from any of the user data symbols and their complements, it 
follows that both tokeni and tokerta are different from any of the scrambled data 
symbols of the set C. The H-code encoder 82 appends both S and H to the H- 
coded data sequence. Both S and H are processed by the G-constraint encoder 
84 along with the H-coded scrambled data symbols. S' and H' correspond to S 
and H after the G-constraint encoding described below. 

[0035] The H-cpde encoder 82 maps M bjt symbols to less than M bit 
symbols (for example, 10-bits to 6-bits using a look-up table) in certain 
circumstances that are described below. The 6-bit patterns have a Hamming 
weight of at least weight two. The 10-bit to 6-bit map can be stored in a look-up 
table. There are ten symbols with a Hamming weight one and forty-five symbols 
having a Hamming of weight two. In total, there are fifty-five 10-bit symbols to 
convert using the table. Further, there are fifteen weight twb 6-bit patterns, 
twenty weight thrise 6-bit patterns, fifteen weight four 6-bit patterns, six weight 
five 6-blt patterns and one weight six 6-bit pattern. Therefore, the fifty-five weight 
one and weight two 10-bit symbols can be converted into fifty-five of the fifty- 
seven possible 6-bit symbols of weight two or greater. This enables a one-to-one 
correspondence that is both static and reversible. 
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[0036] The H-code encoder 82 processes the scrambled symbol set C 
two symbols at a time. If the total Hamming weight of the two symbol group is at 
least four, the data Is passed on without change. If the total Hamming weight of 
the two symbol group is two or three (i.e., [1,1] or [1,2]), a symbol replacement 
occurs. The H-cdde encoder 82 inserts tokeni for the symbol on the left (i.e., the 
first symbol of the two-symboj set). For the symbol on the right (i.e., the second 
symbol of the two-symbol set), the first four bits of the 10-bit symbol are used to 
indicate the position of the "1" in the symbol having the Hamming weight of one. 
The second symbol Is converted Into a 6^bit symbol using the table described 
above. If the Harriming weight of the two-symbol group is three with [2,1], the H- 
code encoder 82 inserts tokena on the left. The H-code encoder 82 uses the first 
four bits of the lO^blt symbol to indicate the position of the "1" in the Hamming 
weight one symbol. The second symbol is converted into a 6-bit symbol using 
the table described above. A more detailed description of the operation and 
function of the H-code encoder 82 is provided in co-pending U.S. Patent 
Application Serial No. 10/693,796, entitled "Methods and Apparatus for Improving 
Minimum Hamming Weights of a Sequence", filed on August 12, 2003, which Is 
commonly assigned and which is incorporated herein by reference in its entirety. 

[0037] The H-coded scrambled data sequence does not contain either 
the all-zero symbol or the all-one symbol. As a result, there are at most eighteen 
consecutive zeros In the H-coded data sequence (e.g., G=18 for M=10). The G- 
constraint encoder 84 further reduces the maximum length of a run of zeros. 
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[0038] Before using the G-constraint encoder 84, the G-constraint is 
equal to 18 for M=^0 because the H-coded data sequence contains neither the all- 
zero symbol nor the all-one symbol. Therefore, the worst case is when a symbol 
1000000000 is followed by a symbol 0000000001. The G-constraint encoder 84 
significantly improves the G-cohstraint from G=1 8 to G=1 1 . 

[0039] Refemng now to FIG. 5B, the DDS/decoder 74 on the read path is 
shown. The DDS/decoder 74 includes a G-constraint decoder that reverses the 
coding that was peirfontied by the G-constraint encoder 84, as will be describiBd 
below. The output of the G-constraint decoder 90 is input to a H-code decoder 92, 
which riBverses the encoding that was performed by the H-code encoder 82 and 
outputs a seed (S) and a scrambled user data sequence to an XOR gate 94. The 
XOR gate outputs the user data sequence. 

[0040] Referring now to FIGs. 6A and 68, two adjacent symbols are 
shown. For any two adjacent symbols with a string of 12 consecutive zeros, the last 
zero position of this string cannot be position 9 of the second symbol. Otherwise, 
the second symbol would need to be an all-zero symbol, which cannot be output by 
the H-code encoder. For the same reason, the start position of the zero string 
cannot be the zero position of the first symbol. 

[0041] Since the string of zeros has 12 zeros or more, the possible start 
positions of the zero string are limited to positions 1, 2, 3, 4, 5, 6, and 7. Therefore, 
only three bits are needed to indicate the start position of a zero string having a 
length of at least 12 bits for M=10. In addition, the bit that is located in a position 
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that immediately precedes tlie zero string must be a 1. Therefore, there are 20-12-1 
= 7 bits that heed to be recxDrded when M = 1 0. 

[0042] The G-cdnstraint encoder 84 detemiines whether there is a string 
of zeros having a length that is greater than or equal to 12 in any two consecutive 
symbols. If true, the first symbol is replaced by the all-one symbol. The second 
symbol is replaced by the following: bit 0 to bit 6 are used to record the remaining 
seven bits. Bit 7 to bit 9 are used to indicate the start position of the zero string. 
Note that if the length the zero string is more than 12, the IS*" zero and subsequent 
zeros are recorded using bit 0 to bit 6. 

[0043] One example implementation of the G-constraint code is shown in 
FIG. 6B. Note that the last three bits of the second symbol after G-constfaint coding 
must contain at least a single 1. Therefore, the maximum number of zeros in the 
second G-constraint coded symbol and the next symbol is (2+9) = 11. In other 
words, the worst case occurs when the second symbol has a 1 in bit position 7 and 
the subsequent symbol in the user data is 0000000001 . As a result of the G-code 
encoding, the global constraint (G) is reduced. In the case of a 10-bit symbol, 
the DDS achieves a G = 1 1 and a minimum Hamming weight of 20%. 

[0044] As can be appreciated, the G-constraint encoder is incremented 
one symbol at a time rather than two symbols at a time. If G-constraint encoding 
is performed on two adjacent symbols, such as dO and d1, it will not be 
performed on d1 and d2 because, by definition, the least significant bits of d1 will 
contain a one which will prevent d1 and d2 from having a run of 12 consecutive 
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zeros. However, if G-constraint encoding is not perfomned on dO and d1, it can 
be performed on d1 and d2. 

[0045] Referring now to FIG. 7, steps that are performed by tlie G- 
constraint encoder 84 are shown in further detail. In step 200, the G-constraint 
encoder 84 initializes BO=dO, which is the first symbol, and B1=d1. which is the 
second symbol that is adjacent to the first symbol. In step 202, control 
detemnines whether there is a string of 6's of length greater than 11 in the 
syrhbols BO and B1. If tme, control performs G-constraint encoding on BO and 
B1. In step 206, control outputs BO. If step 202 is false, control also continues 
with step 206. In step 208, control determines whether there is additional data to 
process. If true, control sets BO = 81 and B1 = new symbpl in step 210 and 
conti-bl continues with step 202. If step 208 is false, control outputs B1 in step 
212 and control ends. 

[0046] Referring now to FIG, 8, steps that are performed by the G- 
cbnstraint decoder 90 in FIG. 5B are shown in further detail. In step 220, the G- 
code decoder 90 initializes BO=dO, which'is the first symbol, and B1=d1, which is 
the second symbol that is adjacent to the first symbol. In step 222, control 
determines whether symbol BO is an all-one symbol. If true, control performs G- 
constraint decoding on BO and B1 . In step 226, control outputs BO. If step 222 is 
false, control also continues with step 226. In step 228, control determines 
whether there is additional data to process. If true, control sets BO = B1 and B1 
= new symbol in step 230 and control continues with step 222. If step 228 is 
false, control outputs B1 in step 232 and control ends. 
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[0047] Those skilled in the art can now appreciate from the foregoing 
description that the broad teachings of the present invention can be implemented 
in a variety of forms. Therefore, while this invention has been described in 
connection with particular examples thereof, the true scope of the invention 
should not be so limited since other modifications will become apparent to the 
skilled practitioner upon a study of the drawings, the specification and the 
following claims. 
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